Introduction

 

Maize (Zea mays L.) is the most important food, feed, industrial raw material and energy crop in the world today, which plays an important role in ensuring world food security, economic development and alleviating energy crisis (Zhao et al. 2016; Zhao et al. 2018). With the continuous improvement of people's living standards, maize breeding has gradually developed in the direction of high yield, high quality and stress resistance (Zhao et al. 2016). Therefore, increasing corn yield is of long-term significance to promote the safe and healthy development of grain in our country.

Nuclear factor NF-Y (Nuclearfactor-Y, NF-Y) is a ubiquitous transcription factor in eukaryotes, which is composed of NF-YA, NF-YB and NF-YC subunits and plays an important role in physiological processes such as plant development and environmental stress response (Yuan 2017). At present, reports on maize NF-Y transcription factors are rare, so it is necessary to analyze the transcription factor family to further understand the role of this transcription factor (Stelpflug et al. 2016). In this study, bioinformatics methods were used to analyze the NF-YB subfamily of nuclear factor NF-Y, including NF-YB expression analysis, evolution analysis, conservative domain analysis, conservative element prediction, gene structure prediction and gene location on chromosome, in order to further understand the function and role of NF-YB transcription factors in maize. The bioinformatics analysis of maize NF-YB family can lay a foundation for in-depth analysis of the function of maize NF-YB transcription factors, and provide a theoretical basis for further using NF-YB transcription factors in maize breeding and improving maize yield.

 

Materials and Methods

 

Plant material

 

From the plant transcription factor database Plant TFDB (http://planttfdb.cbi.pku.edu.cn/), the nucleic acid and protein sequences of maize NF-YB transcription factor family gene and Arabidopsis NF-YB transcription factor family gene for correlation analysis of this study was downloaded (Liu et al. 2015). The name of maize was JKY818, which offered by Jilin Agricultural Science and Technology University, and its seed germination was 99%.

 

Test method

 

Acquisition of NF-YB transcription factor sequence: NF-YB sequences of Arabidopsis thaliana and maize were downloaded from the plant transcription factor database Plant TFDB (http://planttfdb.cbi.pku.edu.cn/). The NF-YB protein sequences of maize and A. thaliana obtained in the above database include proteins translated by multiple transcripts (transcriptionisoform) with different NF-YB genes. In this study, the longest transcripts (and their encoded proteins) of NF-YB genes with multiple transcripts were analyzed. Thus, thirteen NF-YB gene coding regions and their protein sequences of A. thaliana and nineteen NF-YB gene coding regions and protein sequences of maize were obtained.

Construction of phylogenetic tree of transcription factor NF-YB-like gene: Using Clustal X software to carry out multiple matching analyses on the protein sequences of maize and Arabidopsis. Based on the comparison results, using MEGA5.05 to build neighbor joining NJ, set Bootstrap as 1000 repetitions and other as default parameters (Zhao et al. 2015). The deletion data and Poisson model were used to analyze the NF-YB gene in maize.

Analysis of conserved domains of amino acid sequences of NF-YB transcription factor family proteins: Protective domains of amino acid sequences of NF-YB proteins in maize were predicted by using Pfam domain pattern redrawer function in tbtools software.

Location of NF-YB Gene on chromosome: According to the published information of maize genome B73 RefGen_v4, the distribution of 17 NF-YB transcription factor genes on 10 maize chromosomes was determined. The specific location of NF-YB gene on chromosome was found by BLSAT comparison in Maze GDB database (https://www.maizegdb.org/).

Analysis of physico-chemical properties of amino acids: The number of amino acids, molecular weight, theoretical isoelectric point, number of aliphatic amino acids and hydrophobicity of proteins were analyzed by online Protparam tool provided by ExPASY (https://web.expasy.org/protparam/) (Xie et al. 2014).

Sub-cellular localization of NF-YB: Plant-mPLoc(www.csbio. SJTU. Edu. CN) was used to predict the location of 17 NF-YB transcription factor family proteins in cells (Zhu et al. 2012).

Expression analysis of NF-YB family gene: Using the published transcriptome data of maize tissues at different developmental stages, the expression patterns of 17 NF-YB transcription factor family genes in different maize tissues and developmental stages were analyzed. Heat map through the Amazing Heatmap function in TB tools was drawn.

 

Results

 

Identification of NF-YB transcription factors

 

According to the NF-YB proteins identified in the plant transcription factor database Plant TFDB, for the proteins translated by different transcripts of the same NF-YB gene, the protein with the longest amino acid sequence was selected for the following analysis. Thirteen NF-YB proteins of A. thaliana were obtained, which were AT2G37060.1, AT3G53340.1, AT2G38880.8, AT2G13570.1, AT4G14540.1, AT5G47640.1, AT2G47810.1, AT1G09030.1, AT1G21970.1, AT5G47670.1, AT2G27470.1, AT5G08190.1 and AT5G23090.2, respectively. Nineteen maize NF-YB proteins were obtained, which were GRMZM2G064426, GRMZM5G804893, GRMZM2G303465, GRMZM5G866699, GRMZM2G478501, GRMZM2G480621, GRMZM5G809663, GRMZM2G444073, GRMZM2G180947, GRMZM2G384528, GRMZM2G473152, GRMZM2G169884, GRMZM2G152822, GRMZM2G167576, GRMZM2G11789, GRMZM2G124663, GRMZM2G012654, GRMZM2G146286 and GRMZM2G147712, respectively. The NF-YB genes of the above 17 maize were named NF-YB-1, NF-YB-2, NF-YB-3, NF-YB-4, NF-YB-5, NF-YB-6, NF-YB-7, NF-YB-8, NF-YB-9, NF-YB-10, NF-YB-11, NF-YB-12, NF-YB-13, NF-YB-14, NF-YB-15, NF-YB-16 and NF-YB-17, respectively.

 

Phylogenetic analysis of NF-YB Gene in Maize and A. thaliana

 

Twenty eight A. thaliana NF-YB genes and 49 maize NF-YB genes were selected and the protein sequences of NF-YB transcription factor family were compared by ClustalW and the phylogenetic tree was established. NF-YB transcription factors in maize and Arabidopsis could be divided into five groups, which were named Group I, Group II, Group III, Group IV and Group V respectively (Fig. 1). We found that there were 7 members of maize in Group I, which were GRMZM2G064426, GRMZM5G804893, GRMZM2G303465, GRMZM5G866699, GRMZM2G478501, GRMZM2G480621 and GRMZM5G809663. A. thaliana had three members, including AT2G37060.1, AT3G53340.1 and AT2G38880.8. In Group II, maize and A. thaliana each had three members, and maize were GRMZM2G444073, GRMZM2G180947 and GRMZM2G384528 respectively. A. thaliana were AT2G13570.1, AT4G14540.1 and AT5G47640.1. In Group III, maize and A. thaliana had 2 members respectively, maize were GRMZM2G473152 and GRMZM2G169884 respectively. A. thaliana had two members, which were AT2G47810.1 and AT1G09030.1. In Group IV, maize had four members, which were GRMZM2G152822, GRMZM2G167576, GRMZM2G11789 and GRMZM2G124663. A. thaliana had two members, which were AT1G21970.1 and AT5G47670.1 respectively. In Group V, maize had three members and A. thaliana had three members respectively. Maize were GRMZM2G012654, GRMZM2G146286 and GRMZM2G147712 respectively. A. thaliana were AT2G27470.1, AT5G08190.1 and AT5G23090.2 respectively. Among the entire group, maize gene encoded protein accounted for the largest proportion in Group I.

Analysis of conserved domains of NF-YB transcription factor family protein sequences

 

Fig. 1: Phylogenetic evolution of NF-YB genes in maize and A. thaliana

 

Conservative domain refers to a kind of highly conserved domain in the process of biological evolution. Analysis of the conserved domain of the amino acid sequence of NF-YB protein in maize showed that all NF-YB transcription factors in maize contained a CBFD_NFYB_HMF domain, in which GRMZM2G011789, GRMZM2G167576 and GRMZM2G012654 contained a Histone domain, which coincided with their CBFD_NFYB_HMF domain (Fig. 2), while Histone represented a histone binding domain, which may be related to the epigenetic signals such as histone modification read by the above NF-YB transcription factors. GRMZM5G866699 contains a CCT domain. In the CCT domain, it contains a nuclear localization signal and a zinc finger structure that mediates the interaction between the protein and the protein, which means that the protein may form a complex with other transcriptional regulatory factors.

 

Location of NF-YB gene on chromosomes

 

Based on the information of maize genomic B73_ref_v4, we determined the length of 17 NF-YB transcription factor family genes (Table 1), and analyzed their distribution on 10 maize chromosomes. The 17 NF-YB transcription factor family members are unevenly distributed on 10 chromosomes of maize. Chromosome 1 contains NF-YB-5 and NF-YB-16; Chromosome 2 contains NF-YB-10 and NF-YB-14; Chromosome 3 contains NF-YB-9,NF-YB-11 and NF-YB-13;Chromosome 4 contains NF-YB-4 and NF-YB-6;Chromosome 5 contains NF-YB-1;Chromosome 6 contains NF-YB-15;Chromosome 7 contains NF-YB-2, NF-YB-7 and NF-YB-12;Chromosome 8 contains NF-YB-3 and NF-YB-17;Chromosome 9 contains NF-YB-8;While chromosome 10 does not contain this transcription factor family gene (Fig. 3).

 

Analysis of physicochemical properties of NF-YB transcription factor family proteins

 

The amino acid composition and physicochemical properties of different NF-YB transcription factor family proteins are different, and the amino acid number, molecular weight, theoretical isoelectric point, fat coefficient and hydrophilic average coefficient are quite different among different NF-YB transcription factors. As shown in Table 1: Length of NF-YB gene

 

Gene

Locus

Gene location

NF-YB-1

GRMZM2G011789

Chr5: 204332559..204333737

NF-YB-2

GRMZM2G012654

Chr7:68977358..68979646

NF-YB-3

GRMZM2G064426

Chr8:117802830..117806690

NF-YB-4

GRMZM2G124663

Chr4:164097202..164098323

NF-YB-5

GRMZM2G146286

Chr1:219379297..219385327

NF-YB-6

GRMZM2G147712

Chr4:72846053..72852624

NF-YB-7

GRMZM2G152822

Chr7:16191777..16192698

NF-YB-8

GRMZM2G167576

Chr9:36849525..36850583

NF-YB-9

GRMZM2G169884

Chr3154415632..154416136

NF-YB-10

GRMZM2G180947

Chr2210388035..210389391

NF-YB-11

GRMZM2G303465

Chr3:182417996..182420272

NF-YB-12

GRMZM2G384528

Chr7:164694041..164695458

NF-YB-13

GRMZM2G478501

Chr3:21149128..21167654

NF-YB-14

GRMZM2G480621

Chr2:565394..566331

NF-YB-15

GRMZM5G804893

Chr6:152187992..152193678

NF-YB-16

GRMZM5G809663

Chr1:106417592..106419191

NF-YB-17

GRMZM5G866699

Chr8:169431514..169439728

 

 

Fig. 2: Conserved domains of NF-YB amino acid sequence in maize

Table 2, on the whole, NF-YB transcription factor family proteins in maize are rich in acidic amino acids, most of the isoelectric points are in the acidic or weakly acidic range, and only the isoelectric points of NF-YB-3, NF-YB-9 and NF-YB-15 are in the alkaline range. The average hydrophilic coefficient of NF-YB transcription factor family proteins in maize is only negative but not positive, indicating that the transcription factor family proteins are hydrophilic proteins, not hydrophobic proteins. Fat coefficient can be used as an index of protein stability. Generally, the higher the fat coefficient is, the higher the protein stability is. In this study, it was found that the basic fat coefficients of maize NF-YB transcription factors were all in a high range (Table 2), which inferred that the stability of maize NF-YB transcription factor family proteins was good.

 

Subcellular localization of NF-YB transcription factor

 

We use the online tool Plant-mPLoc (http://www.csbio.sjtu.edu.cn/bioinfo/plant-multi/) to predict subcellular localization (Zhu et al. 2012). NF-YB transcription factor family proteins are located in the nucleus (Table 2). Results show that NF-YB transcription factor family proteins are all concentrated in the nucleus and may have the function of transcriptional regulation.

 

Expression analysis of NF-YB in maize

 

Table 2: Analysis of physical and chemical properties and subcellular localization of NF-YB transcription factor family proteins

 

Gene

Locus

Amino acid number

Molecular weight

Theoretical iso electric point

Fat coefficient

Average coefficient of hydrophobicity

Predicted location

NF-YB-1

GRMZM2G011789

303

31934.87

6.39

50.46

-0.552

Nucleus

NF-YB-2

GRMZM2G012654

200

22131.34

5.29

65.05

- 0.071

Nucleus

NF-YB-3

GRMZM2G064426

205

21873.84

7.90

53.85

-0.678

Nucleus

NF-YB-4

GRMZM2G124663

289

30935.69

6.42

50.10

- 0.656

Nucleus

NF-YB-5

GRMZM2G146286

326

36720.67

5.60

66.07

- 0.898

Nucleus

NF-YB-6

GRMZM2G147712

322

36489.49

5.35

69.07

-0.878

Nucleus

NF-YB-7

GRMZM2G152822

222

24065.38

4.97

66.85

-0.526

Nucleus

NF-YB-8

GRMZM2G167576

287

30875.87

6.39

51.71

-0.673

Nucleus

NF-YB-9

GRMZM2G169884

191

21329.08

7.96

54.76

-0.879

Nucleus

NF-YB-10

GRMZM2G180947

230

24652.02

6.08

45.48

-0.690

Nucleus

NF-YB-11

GRMZM2G303465

189

20647.56

6.13

54.23

-0.735

Nucleus

NF-YB-12

GRMZM2G384528

237

24987.31

6.59

43.76

-0.635

Nucleus

NF-YB-13

GRMZM2G478501

225

25177.29

4.51

67.64

-0.762

Nucleus

NF-YB-14

GRMZM2G480621

116

13232.26

5.56

58.88

-0.825

Nucleus.

NF-YB-15

GRMZM5G804893

203

21699.68

7.90

54.43

-0.667

Nucleus.

NF-YB-16

GRMZM5G809663

130

14752.12

5.58

74.31

-0.580

Nucleus.

NF-YB-17

GRMZM5G866699

855

94913.87

5.09

61.04

-0.694

Nucleus.

 

 

Fig. 3: Location of NF-YB transcription factor family genes on different chromosomes of maize

Using the transcriptome sequencing data of maize tissues at different developmental stages published by Stelpflug et al. (2016), we analyzed the expression of 17 NF-YB genes in different developmental stages of maize, and drew a heat map (heatmap) based on the FPKM values of each gene at each developmental stage. The tissues analyzed included germinated seeds at different stages, different regions of roots, seedlings, stems in different positions, apical meristems of stems, leaves, internodes, spikes, anthers and filaments (maize whiskers). The selective expression of NF-YB transcription factor family genes in different tissues and developmental stages in maize (Fig. 4). It has been found that overexpression of GRMZM2G012654 (NF-YB-2) can accelerate cell division and elongation, and promote the growth of taproot (Ballif et al. 2011). In the analysis, it was found that the expression of GRMZM2G064426 (NF-YB-3), GRMZM2G303465 (NF-YB-11), GRMZM5G804893 (NF-YB-15), GRMZM2G146286 (NF-YB-5) and GRMZM2G147712 (NF-YB-6) was very similar to that of GRMZM2G012654 (NF-YB-2), so we speculated that the function of these five NF-YB transcription factor family members was similar to that of GRMZM2G012654 (NF-YB-2). What is worth our most attention is that the expressions of GRMZM2G011789 (NF-YB-1) and GRMZM2G124663 (NF-YB-4) are relatively low during the whole growth and development period, and only at a high level at the early stage of embryo formation, indicating that these two family members play a certain role in embryo formation. In addition, we also found that the overall expression of the six family members of GRMZM2G169884, GRMZM2G152822, GRMZM5G866699, GRMZM5G809663, GRMZM2G478501 and GRMZM2G480621 is relatively low, we speculate that these six members are highly expressed under biotic or abiotic stress, thus regulating the growth and development of maize under stress.

 

Discussion

 

 

Fig. 4: Expression map of NF-YB gene in different tissues of maize

Transcription factors played an important role in the growth and development of plants and had response to changes in the external environment, which were also the key links in the regulation of various physiological activities (Yu et al. 2016; Yu et al. 2018). Nuclear factor Y (Nuclearfactor-Y), referred to as NF-Y, is a transcription factor that binds to the cis-acting element of CCAAT-box and regulates the expression of target genes (Zhuang et al. 2009; Yuan, 2017). It is common in eukaryotes such as yeast, plants, animals and so on (Liu et al. 2016). Plant nuclear transcription factors are encoded by multiple genes, mainly including nuclear transcription factor A subunit family (NF-YA) (CBF-B or HAP2), B subunit family (NF-YB) (CBF-A or HAP3) and C subunit family (NF-YC) (CBF-C or HAP5) (Romier et al. 2003). Existing studies have shown that plant NF-Y transcription factors play an important role in plant embryonic development, photosynthesis, flowering time regulation and stress response, but the related studies are mainly carried out in A. thaliana, the function of this transcription factor in maize has not been reported (Cai et al. 2007; Chen et al. 2007).

In this study, the characteristics of NF-YB family genes in maize were analyzed by bioinformatics analysis. We screened nineteen NF-YB sequences from maize NF-YB transcription factor family, which is more than the number of A. thaliana, indicating that the expansion of the number of NF-YB genes may be the evolutionary characteristics of maize to better adapt to the environment. All 17 transcription factors contain the same conserved domain CBFD_NFYB_HMF, and individual sequences also contain CTT domain and histones, indicating that the function of this transcription factor family is diverse. Comparing the physical and chemical properties of NF-YB transcription factor family proteins, the transcription factor family proteins in maize are rich in acidic amino acids, most of the isoelectric points are in the acidic range, and the average hydrophilic coefficient of transcription factor family proteins is only negative but not positive, indicating that the that NF-YB transcription factor family proteins are not very complex. In subcellular localization, it is found that NF-YB transcription factors are mostly located in the nucleus like traditional transcription factors. There is a close relationship between gene expression pattern and function. Different genes in NF-YB family are expressed differently in different tissues and periods, in which GRMZM2G064426 and GRMZM2G303465 are highly expressed in different tissues and periods, indicating that they play an important role in the whole growth and development of maize. However, GRMZM2G011789 and GRMZM2G124663 are only highly expressed during embryonic development, indicating that they play an important role in the regulation of embryonic development and maturation. In summary, this study laid a theoretical foundation for further exploring the gene function of maize NF-YB family.

Thus, the results of this study promote the understanding of NF-YB family genes and lay a foundation for the functional study of NF-YB family genes at the molecular level.

Conclusion

 

The NF-YB transcription factor family in maize was identified by bioinformatics tools, which provided some basic data for further utilization of NF-FB transcription factors in maize breeding.

 

Acknowledgements

 

The authors acknowledge the Science and Technology Innovation Development Project of Jilin Province of China (#20200402025NC to L.J.), Science and Technology Innovation Development Project of Jilin City (#202031781 to L.J.), Science and Technology Innovation Development Project of Jilin City (#202031729 to L.J.), Natural Science Foundation of Jilin Province of China (#20180101233JC to Z.-Y.X.).

 

References

 

Ballif J, E Saori, K Mitsuru, MA Jennifer, YJ Wu (2011). Over-expression of HAP3b enhances primary root elongation in Arabidopsis. Plant Physiol Biochem 49:579‒583

Cai XN, B Jenny, E Saori, D Elizabeth, MX Liang, C Dong, D Dewald, J Kreps, T Zhu, YJ Wu (2007). A putative CCAAT-binding transcription factor is a regulator of flowering timing in Arabidopsis. Plant Phyiol 145:98‒105

Chen NZ, XQ Zhang, PC Wei, QJ Chen, F Ren, J Chen, XC Wang (2007). AtHAP3b plays a crucial role in the regulation of flowering time in Arabidopsis during osmotic stress. J Biochem Mol Biol 40:1083‒1089

Liu C, XX Li, YL Su, YF Guo (2015). Genome-wide identification, phylogenetic analysis and expression profiling of the SBP transcription factor family in Nicotiana tobacum. Chin Tob Sci 36:1‒11

Liu J, ZZ Xu, N Yuan, Y Guo, BL Zhang, JC Du (2016). Genome-wide analysis of NF-YB gene family in Gossypium hirsutum L. Acta Agric Boreal-Sin 31:21–27

Romier C, F Cocchiarella, R Mantovani, D Moras (2003). The NF-YB/NF-YC structure gives insight into DNA binding and transcription regulation by CCAAT factor NF-Y. J Biol Chem 278:1336‒1345

Stelpflug SC, RS Sekhon, B Vaillancourt (2016). An expanded maize gene expression atlas based on RNA sequencing and its use to explore root development. Plant Genomics 9:8594

Xie T, S Wang, L Huang, X Wang, LP Kang, LP Guo (2014). Transcriptome-based bioinformatics analysis of Arnebia euchroma ERF transcription factor family. Chin J Trad Chin Med 39:4732‒4739

Yu XM, LL Jiang, R Wu, XC Meng, A Zhang, N Li, Q Xia, X Qi, JS Pang, ZY Xu, B Liu (2016). The Core Subunit of a Chromatin-remodeling complex, zmchb101, plays essential roles in maize growth and development. Sci Rep 6; Article 38504

Yu XM, XC Meng, YT Liu, N Li, A Zhang, TJ Wang, LL Jiang, JS Pang, XX Zhao, X Qi, MS Zhang, SC Wang, B Liu (2018). The chromatin remodeler ZmCHB101 impacts expression of osmotic stress-responsive genes in maize. Plant Mol Biol 97:451465

Yuan GC (2017). Identification of Potato NF-TB Family Genes and Functional Characterization of StNF-YB3.1 in Potato (Solanum tuberosum L.), Inner Mongolia University, China

Zhao JR, RH Wang, XX Liu (2016). The current situation of maize industry and the development trend of biological breeding in China. Biol Bus 3:45‒52

Zhao JR, S Wang, M Li, HY Lv, DW Wang, YQ Ge, X Wei, WC Yang (2018). Current status and perspective of maize breeding. J Plant Genet Res 19:435‒446

Zhao JL, WJ Yao, SJ Wang, YB Jiang, BR Zhou (2015). AP2/ERF gene family in Populus trichocarpa by bioinformatics. J Northeast For Univ 43:21‒29

Zhu S, L Zeng, PZ Wu, YP Chen, HW Jiang, GJ Wu, MR Li (2012). Cloning and sequence analysis of the cDNA of plastid-located glycerol-3-phosphate acyltransferase (JcGPAT2) gene from Jatropha curcas L. J Guangdong Agric Sci 39:1‒5

Zhuang J, R H Peng, F Gao, XY Fu, B Zhu, XF Jin, J Zhang, AS Xiong, QH Yao (2009). Cloning and analysis of AP2/ERF-B1 Subfamily transcription factors from Brassica napus L. Huyou 15. J Nucl Agric Sci 23:435‒441